Landmark-Based Pronunciation Error Identification on Chinese Learning
نویسندگان
چکیده
This paper explores a novel approach of identifying pronunciation errors for the second language (L2) learners based on the landmark theory of human speech perception. Earlier works on the selection method of distinctive features and the likelihoodbased “goodness of pronunciation” (GOP) measurement have gained progress in several L2 languages, e.g. Dutch and English. However, the improvement of performance is limited due to error-prone automatic speech recognition (ASR) systems and less distinguishable features. Landmark theory posits the existence of quantal nonlinearities in the articulatory-acoustic relationship, and provides a basis of selecting landmark positions that are suitable for identifying pronunciation errors. By leveraging this English acoustic landmark theory, we propose to select Mandarin Chinese salient phonetic landmarks for the Top-16 frequently mispronounced phonemes by Japanese (L1) learners, and extract features at those landmarks including mel-frequency cepstral coefficients (MFCC) and formants. Both cross validation and evaluation are performed for individual phonemes using support vector machine with linear kernel. Experiments illustrate that our landmark-based approaches achieve higher micro-average f1 score significantly than GOPbased methods.
منابع مشابه
Exploiting Bishun to Predict the Pronunciation of Chinese
Learning to pronounce Chinese characters is usually considered as a very hard part to foreigners to study Chinese. At beginning, Chinese learners must bear in mind thousands of Chinese characters, including their pronunciation, meanings, Bishun (order of strokes) etc., which is very time consuming and boring. In this paper, we proposed a novel method based on translation model to predict the Ch...
متن کاملTransfer Learning based Non-native Acoustic Modeling for Pronunciation Error Detection
The scarcity of large-scale non-native corpora and human annotations are two fundamental challenges in the development of computer-assisted pronunciation training (CAPT) systems. We explored several transfer learning based methods to detect the pronunciation errors without using nonnative training data. Effects were confirmed in the Mandarin Chinese pronunciation error detection of Japanese spe...
متن کاملArticulatory Modeling for Pronunciation Error Detection without Non-Native Training Data Based on DNN Transfer Learning
Aiming at detecting pronunciation errors produced by second language learners and providing corrective feedbacks related with articulation, we address effective articulatory models based on deep neural network (DNN). Articulatory attributes are defined for manner and place of articulation. In order to efficiently train these models of non-native speech without such data, which is difficult to c...
متن کاملThe Impact of Computer–Assisted Language Learning (CALL) /Web-Based Instruction on Improving EFL Learners’ Pronunciation Ability
The purpose of this study was to investigate the effect of CALL/Web-based instruction on improving EFL learners’ pronunciation ability. To this end, 85 students who were enrolled in a language institute in Rasht were selected as subjects. These students were given the Oxford Placement Test in order to validate their proficiency levels. They were then divided into two groups of 30 and were...
متن کاملRule-based Word Pronunciation Networks Generation for Mandarin Speech Recognition
Modeling pronunciation variation in spontaneous speech is very important for improving the recognition accuracy. One limitation of current recognition systems is their dictionaries for recognition only contain one standard pronunciation for each entry, so that the amount of variability that can be modeled is very limited. In this paper, we proposed to generate pronunciation networks based on ru...
متن کامل